Speech formant frequency and bandwidth tracking using multiband energy demodulation
نویسندگان
چکیده
In this paper, the amplitude and frequency AM–FM modulation model and a multiband demodulation analysis scheme are applied to formant frequency and bandwidth tracking of speech signals. Filtering by a bank of Gabor bandpass filters is performed to isolate each speech resonance in the signal. Next, the amplitude envelope AM and instantaneous frequency FM are estimated for each band using the energy separation algorithm ESA . Short-time formant frequency and bandwidth estimates are obtained from the instantaneous amplitude and frequency signals; two frequency estimates are proposed and their relative merits are discussed. The short-time estimates are used to compute the formant locations and bandwidths. Performance and computational issues of the algorithm are discussed. Overall, multiband demodulation analysis MDA is shown to be a useful tool for extracting information from the speech resonances in the time–frequency plane. © 1996 Acoustical Society of America.
منابع مشابه
A new multicomponent AM-FM demodulation with predicting frequency boundaries and its application to formant estimation
In this paper, a method using dynamic programming to predict frequency boundaries is proposed for the joint demodulation of amplitude modulation (AM) and frequency modulation (FM) for speech signals. Because of the existence of modulations in speech signal, an algorithm called energy separation algorithm (ESA) has been developed to track the energy needed by a source to produce the speech signa...
متن کاملInstantaneous Energy Operators : Applications To
The nonlinear energy operator (x) _ x] 2 ? x x and its discrete-time counterpart have found numerous applications including development of the energy separation algorithm (ESA) for demodulat-ing AM-FM signals, tracking speech modulations, and detecting various events in nonstationary signals. In this paper we rst present some improvements on the energy operator and ESA when applied to demodulat...
متن کاملA multimodal density function estimation approach to formant tracking
We address the problem of robust formant tracking in continuous speech. We propose the robust statistical model of t-distribution mixture density (tMM) operating on the “pyknogram” obtained through a multiband AM-FM demodulation technique. The statistical model of the pyknogram is shown to be more-effective to handle the variability in the signal processing stage. The t-mixture density estimati...
متن کاملSpeech analysis and synthesis using an AM-FM modulation model
In this paper, the AM{FM modulation model is applied to speech analysis, synthesis and coding. The multiband demodulation pitch tracking algorithm is proposed that produces smooth and accurate fundamental frequency contours. The AM{ FM modulation vocoder represents speech as the sum of resonance signals modeled by their amplitude envelope and instantaneous frequency signals. E cient modeling an...
متن کاملTracking Formant Trajectory of Continuous Chinese Whispered Speech with Hidden Dynamic Model Based on Dynamic Target Orientation
Aimed at the characteristics of Chinese whispered speech formants, i.e., migrating to highfrequency, increased bandwidth, and increased spurious peaks and merged peaks, a method of tracking the formant trajectory of continuous Chinese whispered speech using the Hidden Dynamic Model (HDM) with dynamic target orientation was put forward in this study. The calculation proceeded as follows: firstly...
متن کامل